Search CORE

602 research outputs found

Antennal transcriptome profiles of anopheline mosquitoes reveal human host olfactory specialization in Anopheles gambiae

Author: AGC Consortium
Pitts RJ
Rinker DC
Rokas A
Zhou X
Zwiebel LJ
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 22/10/2013
Field of study

BACKGROUND: Two sibling members of the Anopheles gambiae species complex display notable differences in female blood meal preferences. An. gambiae s.s. has a well-documented preference for feeding upon human hosts, whereas An. quadriannulatus feeds on vertebrate/mammalian hosts, with only opportunistic feeding upon humans. Because mosquito host-seeking behaviors are largely driven by the sensory modality of olfaction, we hypothesized that hallmarks of these divergent host seeking phenotypes will be in evidence within the transcriptome profiles of the antennae, the mosquito's principal chemosensory appendage. RESULTS: To test this hypothesis, we have sequenced antennal mRNA of non-bloodfed females from each species and observed a number of distinct quantitative and qualitative differences in their chemosensory gene repertoires. In both species, these gene families show higher rates of sequence polymorphisms than the overall rates in their respective transcriptomes, with potentially important divergences between the two species. Moreover, quantitative differences in odorant receptor transcript abundances have been used to model potential distinctions in volatile odor receptivity between the two sibling species of anophelines. CONCLUSION: This analysis suggests that the anthropophagic behavior of An. gambiae s.s. reflects the differential distribution of olfactory receptors in the antenna, likely resulting from a co-option and refinement of molecular components common to both species. This study improves our understanding of the molecular evolution of chemoreceptors in closely related anophelines and suggests possible mechanisms that underlie the behavioral distinctions in host seeking that, in part, account for the differential vectorial capacity of these mosquitoes

Springer - Publisher Connector

PubMed Central

Spiral - Imperial College Digital Repository

Bayesian Inference of Species Trees from Multilocus Data

Author: A. J. Drummond
Degnan
Drummond
Edwards
Felsenstein
Gernhard
Glor
Griffiths
Heled
Huelsenbeck
J. Heled
Kuhner
Liu
Pamilo
Rokas
Wu
Publication venue: Oxford University Press
Publication date: 26/09/2013
Field of study

Until recently, it has been common practice for a phylogenetic analysis to use a single gene sequence from a single individual organism as a proxy for an entire species. With technological advances, it is now becoming more common to collect data sets containing multiple gene loci and multiple individuals per species. These data sets often reveal the need to directly model intraspecies polymorphism and incomplete lineage sorting in phylogenetic estimation procedures

CiteSeerX

Crossref

PubMed Central

Recommended from our members

Future-Proofing Your Microbiology Resource Announcements Genome Assembly for Reproducibility and Clarity.

Author: Baltrus David A
Cuomo Christina A
Dennehy John J
Dunning Hotopp Julie C
Maresca Julia A
Newton Irene LG
Rasko David A
Rokas Antonis
Roux Simon
Stajich Jason E
Publication venue: eScholarship, University of California
Publication date: 05/09/2019
Field of study

Descriptions of resources, like the genome assemblies reported in Microbiology Resource Announcements, are often frozen at their time of publication, yet they will need to be interpreted in the midst of continually evolving technologies. It is therefore important to ensure that researchers accessing published resources have access to all of the information required to repeat, interpret, and extend these original analyses. Here, we provide a set of suggestions to help make certain that published resources remain useful and repeatable for the foreseeable future

eScholarship - University of California

A complementary view on the growth of directory trees

Author: A. Rokas
C. Dupuis
C. J. Tessone
D. Garlaschelli
D. Garlaschelli
D. Knuth
D.A. Huffman
E. Codd
E. Weibel
E.A. Herrada
F. Schweitzer
J. Cracraft
J.R. Banavar
K. Klemm
K. Klemm
L. Muchnik
M. M. Geipel
M. Zamir
P. Prusinkiewicz
P.L. Krapivsky
P.L. Krapivsky
P.L. Krapivsky
S. Dorogovtsev
S. Golder
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Trees are a special sub-class of networks with unique properties, such as the level distribution which has often been overlooked. We analyse a general tree growth model proposed by Klemm {\em et. al.} (2005) to explain the growth of user-generated directory structures in computers. The model has a single parameter

q

which interpolates between preferential attachment and random growth. Our analysis results in three contributions: First, we propose a more efficient estimation method for

q

based on the degree distribution, which is one specific representation of the model. Next, we introduce the concept of a level distribution and analytically solve the model for this representation. This allows for an alternative and independent measure of

q

. We argue that, to capture real growth processes, the

q

estimations from the degree and the level distributions should coincide. Thus, we finally apply both representations to validate the model with synthetically generated tree structures, as well as with collected data of user directories. In the case of real directory structures, we show that

q

measured from the level distribution are incompatible with

q

measured from the degree distribution. In contrast to this, we find perfect agreement in the case of simulated data. Thus, we conclude that the model is an incomplete description of the growth of real directory structures as it fails to reproduce the level distribution. This insight can be generalised to point out the importance of the level distribution for modeling tree growth.Comment: 16 pages, 7 figure

arXiv.org e-Print Archive

Repository for Publications and Research Data

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

Research Papers in Economics

Why highly expressed proteins evolve slowly

Author: Akashi
Akashi
Akashi
Bloom
Bucciantini
C. Adami
C. O. Wilke
Cho
Coghlan
D. A. Drummond
Dong
Duret
Ellis
F. H. Arnold
Fraser
Ghaemmaghami
Goldberg
Greenbaum
Gu
Herbeck
Hirsh
Holstege
Hurst
J. D. Bloom
Kellis
Kellis
Kurtzman
Marais
Pal
Pal
Parker
Precup
P l
Rokas
Seoighe
Sharp
Sharp
Spreitzer
Subramanian
Wall
Yang
Zuckerkandl
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 12/08/2005
Field of study

Much recent work has explored molecular and population-genetic constraints on the rate of protein sequence evolution. The best predictor of evolutionary rate is expression level, for reasons which have remained unexplained. Here, we hypothesize that selection to reduce the burden of protein misfolding will favor protein sequences with increased robustness to translational missense errors. Pressure for translational robustness increases with expression level and constrains sequence evolution. Using several sequenced yeast genomes, global expression and protein abundance data, and sets of paralogs traceable to an ancient whole-genome duplication in yeast, we rule out several confounding effects and show that expression level explains roughly half the variation in Saccharomyces cerevisiae protein evolutionary rates. We examine causes for expression's dominant role and find that genome-wide tests favor the translational robustness explanation over existing hypotheses that invoke constraints on function or translational efficiency. Our results suggest that proteins evolve at rates largely unrelated to their functions, and can explain why highly expressed proteins evolve slowly across the tree of life.Comment: 40 pages, 3 figures, with supporting informatio

arXiv.org e-Print Archive

Crossref

PubMed Central

Caltech Authors

Horizontal Transfer and Death of a Fungal Secondary Metabolic Gene Cluster

Author: Altschul
Amselem
Antonis Rokas
Capella-Gutierrez
Darling
Hittinger
Hittinger
James
Jason C. Slot
Katoh
Keller
Keller
Khaldi
Khaldi
Limon
Ma
Martchenko
Matthew A. Campbell
Patron
Price
Rokas
Shimodaira
Slot
Slot
Slot
Stamatakis
Suyama
Wiemann
Wohlbach
Yang
Publication venue: Oxford University Press
Publication date
Field of study

A cluster composed of four structural and two regulatory genes found in several species of the fungal genus Fusarium (class Sordariomycetes) is responsible for the production of the red pigment bikaverin. We discovered that the unrelated fungus Botrytis cinerea (class Leotiomycetes) contains a cluster of five genes that is highly similar in sequence and gene order to the Fusarium bikaverin cluster. Synteny conservation, nucleotide composition, and phylogenetic analyses of the cluster genes indicate that the B. cinerea cluster was acquired via horizontal transfer from a Fusarium donor. Upon or subsequent to the transfer, the B. cinerea gene cluster became inactivated; one of the four structural genes is missing, two others are pseudogenes, and the fourth structural gene shows an accelerated rate of nonsynonymous substitutions along the B. cinerea lineage, consistent with relaxation of selective constraints. Interestingly, the bik4 regulatory gene is still intact and presumably functional, whereas bik5, which is a pathway-specific regulator, also shows a mild but significant acceleration of evolutionary rate along the B. cinerea lineage. This selective preservation of the bik4 regulator suggests that its conservation is due to its likely involvement in other non–bikaverin-related biological processes in B. cinerea. Thus, in addition to novel metabolism, horizontal transfer of wholesale metabolic gene clusters might also be contributing novel regulation

Crossref

PubMed Central

The Awesome Power of Yeast Evolutionary Genetics: New Genome Sequences and Strain Resources for the Saccharomyces sensu stricto Genus

Author: Dunham Maitreya J.
Eisen Michael B.
Hittinger Chris Todd
Johnston Mark
Payen Celia
Rine Jasper
Rokas Antonis
Scannell Devin R.
Zill Oliver A.
Publication venue: Genetics Society of America
Publication date: 01/01/2011
Field of study

High-quality, well-annotated genome sequences and standardized laboratory strains fuel experimental and evolutionary research. We present improved genome sequences of three species of Saccharomyces sensu stricto yeasts: S. bayanus var. uvarum (CBS 7001), S. kudriavzevii (IFO 1802T and ZP 591), and S. mikatae (IFO 1815T), and describe their comparison to the genomes of S. cerevisiae and S. paradoxus. The new sequences, derived by assembling millions of short DNA sequence reads together with previously published Sanger shotgun reads, have vastly greater long-range continuity and far fewer gaps than the previously available genome sequences. New gene predictions defined a set of 5261 protein-coding orthologs across the five most commonly studied Saccharomyces yeasts, enabling a re-examination of the tempo and mode of yeast gene evolution and improved inferences of species-specific gains and losses. To facilitate experimental investigations, we generated genetically marked, stable haploid strains for all three of these Saccharomyces species. These nearly complete genome sequences and the collection of genetically marked strains provide a valuable toolset for comparative studies of gene function, metabolism, and evolution, and render Saccharomyces sensu stricto the most experimentally tractable model genus. These resources are freely available and accessible through www.SaccharomycesSensuStricto.org

CiteSeerX

Crossref

PubMed Central

Digital Commons@Becker

eScholarship - University of California

Functional significance may underlie the taxonomic utility of single amino acid substitutions in conserved proteins

Author: A Rokas
A Stechmann
D Bryant
D Schlieper
DH Huson
DK Fygenson
E Nogales
ET Steenkamp
F Burki
Gerd K. Wagner
H Aldaz
J Lowe
JD Hackett
JH Nettles
JM Hush
Katharina T. Huber
Kevin M. Tyler
KH Downing
M Delattre
M Hari
M Muraoka
PJ Keeling
PJ Keeling
PJ Keeling
Qiong Wu
RB Ravelli
RC Moore
S Douglas
SF Altschul
SL Shaw
VP Edgcomb
Y Peer Van de
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

We hypothesized that some amino acid substitutions in conserved proteins that are strongly fixed by critical functional roles would show lineage-specific distributions. As an example of an archetypal conserved eukaryotic protein we considered the active site of ß-tubulin. Our analysis identified one amino acid substitution—ß-tubulin F224—which was highly lineage specific. Investigation of ß-tubulin for other phylogenetically restricted amino acids identified several with apparent specificity for well-defined phylogenetic groups. Intriguingly, none showed specificity for “supergroups” other than the unikonts. To understand why, we analysed the ß-tubulin Neighbor-Net and demonstrated a fundamental division between core ß-tubulins (plant-like) and divergent ß-tubulins (animal and fungal). F224 was almost completely restricted to the core ß-tubulins, while divergent ß-tubulins possessed Y224. Thus, our specific example offers insight into the restrictions associated with the co-evolution of ß-tubulin during the radiation of eukaryotes, underlining a fundamental dichotomy between F-type, core ß-tubulins and Y-type, divergent ß-tubulins. More broadly our study provides proof of principle for the taxonomic utility of critical amino acids in the active sites of conserved proteins

Crossref

Springer - Publisher Connector

PubMed Central

University of East Anglia digital repository

TaxMan: a taxonomic database manager

Author: A Rokas
C Lee
D Gordon
DA Benson
H Philippe
JD Thompson
JE Stajich
M Jones
Mark Blaxter
Martin Jones
PC Feijao
SA Olson
SF Altschul
W Ludwig
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Phylogenetic analysis of large, multiple-gene datasets, assembled from public sequence databases, is rapidly becoming a popular way to approach difficult phylogenetic problems. Supermatrices (concatenated multiple sequence alignments of multiple genes) can yield more phylogenetic signal than individual genes. However, manually assembling such datasets for a large taxonomic group is time-consuming and error-prone. Additionally, sequence curation, alignment and assessment of the results of phylogenetic analysis are made particularly difficult by the potential for a given gene in a given species to be unrepresented, or to be represented by multiple or partial sequences. We have developed a software package, TaxMan, that largely automates the processes of sequence acquisition, consensus building, alignment and taxon selection to facilitate this type of phylogenetic study. RESULTS: TaxMan uses freely available tools to allow rapid assembly, storage and analysis of large, aligned DNA and protein sequence datasets for user-defined sets of species and genes. The user provides GenBank format files and a list of gene names and synonyms for the loci to analyse. Sequences are extracted from the GenBank files on the basis of annotation and sequence similarity. Consensus sequences are built automatically. Alignment is carried out (where possible, at the protein level) and aligned sequences are stored in a database. TaxMan can automatically determine the best subset of taxa to examine phylogeny at a given taxonomic level. By using the stored aligned sequences, large concatenated multiple sequence alignments can be generated rapidly for a subset and output in analysis-ready file formats. Trees resulting from phylogenetic analysis can be stored and compared with a reference taxonomy. CONCLUSION: TaxMan allows rapid automated assembly of a multigene datasets of aligned sequences for large taxonomic groups. By extracting sequences on the basis of both annotation and BLAST similarity, it ensures that all available sequence data can be brought to bear on a phylogenetic problem, but remains fast enough to cope with many thousands of records. By automatically assisting in the selection of the best subset of taxa to address a particular phylogenetic problem, TaxMan greatly speeds up the process of generating multiple sequence alignments for phylogenetic analysis. Our results indicate that an automated phylogenetic workbench can be a useful tool when correctly guided by user knowledge

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

Exponential distribution of long heart beat intervals during atrial fibrillation and their relevance for white noise behaviour in power spectrum

Author: A. Bollmann
A.D. Krahn
B.S. Stambler
C. Hoglund
C.J. Meurling
E.J. Benjamin
F. Gaita
F. Pinciroli
F. Pinciroli
G. Moody
J. Hayano
J. Honerkamp
J.L. Wells Jr.
Junichiro Hayano
K. Manabe
K.A. Boahene
K.T. Konings
M. Holm
M.H. Raitt
P. Bernaolo-Galván
P. Weismuller
P.A. Wolf
P.Ch. Ivanov
P.Ch. Ivanov
P.M. Verhorst
Philipp Maass
R.H. Peter
S. Nattel
S.N. Hatzido
St. Rokas
Stefan Heinrichs
T. Yamada
Thomas Hennig
W.J. Hobbs
Y. Asano
Publication venue
Publication date: 11/05/2006
Field of study

The statistical properties of heart beat intervals of 130 long-term surface electrocardiogram recordings during atrial fibrillation (AF) are investigated. We find that the distribution of interbeat intervals exhibits a characteristic exponential tail, which is absent during sinus rhythm, as tested in a corresponding control study with 72 healthy persons. The rate of the exponential decay lies in the range 3-12 Hz and shows diurnal variations. It equals, up to statistical uncertainties, the level of the previously uncovered white noise part in the power spectrum, which is also characteristic for AF. The overall statistical features can be described by decomposing the intervals into two statistically independent times, where the first one is associated with a correlated process with 1/f noise characteristics, while the second one belongs to an uncorrelated process and is responsible for the exponential tail. It is suggested to use the rate of the exponential decay as a further parameter for a better classification of AF and for the medical diagnosis. The relevance of the findings with respect to a general understanding of AF is pointed out

arXiv.org e-Print Archive

Crossref

PubMed Central